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The severe acute respiratory syndrome coronavirus (SARS-CoV) nonstructural proteins nspl to nspl6 have 
been implicated by genetic analysis in the assembly of a functional replication/transcription complex. We 
report the crystal structure of nsplO from SARS-CoV at 2.1-A resolution. The nsplO structure has a novel fold, 
and 12 identical subunits assemble to form a unique spherical dodecameric architecture. Two zinc fingers have 
been identified from the nsplO monomer structure with the sequence motifs C-(X) 2 -C-(X) 5 -H-(X) 6 -C and 
C-(X) 2 -C-(X) 7 -C-(X)-C. The nsplO crystal structure is the first of a new class of zinc finger protein three- 
dimensional structures to be revealed experimentally. The zinc finger sequence motifs are conserved among all 
three coronavirus antigenic groups, implicating an essential function for nsplO in all coronaviruses. Based on 
the structure, we propose that nsplO is a transcription factor for coronavirus replication/transcription. 


Coronaviruses (CoVs) are positive-stranded RNA viruses 
known to cause highly prevalent respiratory and enteric dis¬ 
eases in humans and animals. About two-thirds of their ge¬ 
nomes encode the viral replicase that mediates viral RNA 
synthesis (19). The replicase gene is comprised of two open 
reading frames at the 5' end of the genome, termed ORFla 
and ORFlb (26). The upstream ORFla encodes polyprotein 
ppla (450 to 500 kDa), while ORFla and ORFlb together 
encode pplab (750 to 800 kDa). Expression of the ORFlb- 
encoded half of pplab requires a —1 ribosomal frameshift 
during translation. The polyproteins ppla and pplab undergo 
extensive proteolytic processing by viral proteases to produce 
16 functional subunits known as nonstructural proteins, which 
then assemble to form the replicase complex required for viral 
replication and transcription. 

Prior to the global severe acute respiratory syndrome 
(SARS) outbreak in 2003, scant attention was paid to corona¬ 
viruses by researchers, as this genus of viruses causes severe 
diseases predominantly in animals and only comparatively mild 
diseases in humans. In the wake of the SARS outbreak, greater 
attention has been focused on the replicase proteins with a 
view to understanding the replication/transcription machinery 
and to identify new therapeutic targets. To date, the three- 
dimensional structures of a series of nsp proteins have been 
reported (1). nsp5, also called the main protease or 3C-like 
protease, was the first SARS protein structure determined in 
2003 (24) and has since been the focus of concerted efforts for 
the design of antiviral inhibitors. Last year, a broad-spectrum 
inhibitor was reported with efficient in vitro inactivation of 
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multiple coronavirus main proteases, potent antiviral activity, 
and extremely low cellular toxicity (23). The structure of nsp9 
was determined in 2004, and it was found to be a single- 
stranded RNA binding protein (4, 17). More recently, the 
complex structure between nsp7 and nsp8 revealed a hexa- 
decameric assembly that should constitute a processivity factor 
for nspl2, an RNA-dependent RNA polymerase (25). Also 
determined recently was the ADP ribose 1-phosphatase do¬ 
main of nsp3 (14). 

nsplO and nspll immediately precede nspl2 in the pplab 
sequence and have been implicated in RNA synthesis, while 
the mechanism remains to be illustrated through both struc¬ 
tural and functional studies. To improve our understanding of 
functions of CoV nsp proteins, we carried out systematic struc¬ 
tural studies on this group of proteins. We report the crystal 
structure of nsplO. This protein appears to have a novel fold, 
featuring two zinc fingers with C-(X) 2 -C-(X) 5 -H-(X) 6 -C and 
C-(X) 2 -C-(X) 7 -C-(X)-C motifs. Furthermore, 12 identical sub¬ 
units assemble into a novel spherical dodecameric architecture 
which is proposed to be a functional form of nsplO. 


MATERIALS AND METHODS 

Protein expression, purification, and characterization. The coding sequence 
for SARS-CoV nsplO-nspll was amplified by PCR from the SARS-CoV BJ01 
strain (corresponding to Ala4231 to Val4382 of the ORFla polyprotein) and 
inserted into pGEX-6p-l plasmid DNA (Amersham Biosciences) using BamHI 
and Xhol sites. The forward and reverse PCR primers used for amplification 
were F'-nsplO-nspll (5' -CGGGATCCGCTGGAAATGCTACAGAAGT-3') 
and R'-nsplO-nspll (5'-CGCTCGAGTTACACCGCAAACCCGTTTA-3'), re¬ 
spectively. The protein was expressed in Escherichia coli strain BL21(DE3) as a 
glutathione 5-transferase (GST) fusion protein and purified using glutathione 
affinity chromatography. The GST tag was removed by PreScission protease 
(Amersham Biosciences), leading to five additional residues (GPLGS) at the N 
terminus. The protein was further purified by a G25 desalting column in 2 mM 
dithiothreitol (DTT) and concentrated to 10 mg ml -1 in preparation for crys¬ 
tallization screening. 
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TABLE 1. Data collection and refinement statistics 


Statistical element 

nsplO 

nsplO in the absence of zinc 

Data collection statistics 

Cell parameters 

a = 159.1 A, b = 321.8 A, c = 161.9 A, 
a = (3 = y = 90° 

a = 159.7 A, b = 322.7 A, c = 162.2 A, 
a = (3 = y = 90° 

Space group 

C222j 

C222 1 

Resolution (A) 

50 (2.2) c -2.1 

50 (2.8)-2.7 

No. of all reflections 

1,545,407 

845,526 

No. of unique reflections 

276,259 

114,838 

Completeness (%) 

99.5 (99.4) 

100.0 (99.9) 

Average I/cr(I) 

10.5 (4.2) 

6.6 (3.8) 

Emerge* (%) 

6.1 (36.3) 

10.4 (40.1) 

Refinement statistics 

No. of reflections used (cr(F) > 0) 

468,890 

249,037 

V (%) 

21.9 

21.6 

«fre/ (%) 

24.7 

25.1 

RMSD bond distance (A) 

0.013 

0.008 

RMSD bond angle (°) 

1.67 

1.79 

Average B value (A 2 ) 

31.8 

34.5 

Ramachandran plot (excluding Pro and Gly) 

Res. in most favored regions 

2,039 (87.6%) 

1,932 (83%) 

Res. in additionally allowed regions 

278 (12.0%) 

375 (16.1%) 

Res. in generously allowed regions 

0 

18 (0.8%) 


a i? mcrgc = SfAiU — </ ft >/Xfc2/ <//,>, where <I h > is the mean of the observations 4, of reflection h. 

b ^work = 2(||F p (observed) — i ? p (calculated)||)/S /^(observed). /? free indicates the R factor for a selected subset (5%) of the reflections that was not included in prior 
refinement calculations. 

c Numbers in parentheses are corresponding values for the highest resolution shell. 


Crystallization. SARS nsplO-nspll crystals were grown at 291 K by the hang¬ 
ing drop vapor diffusion method from a sodium chloride system in desalting 
buffer supplemented with 65 mM zinc chloride. The optimum brick-like crystals 
were obtained using a reservoir solution of 1.8 M NaCl, 0.1 M NaH 2 P0 4 , 0.1 M 
KH 2 P0 4 , 0.1 M DTT, and 2-(./V-morpholino)ethanesulfonic acid (MES; pH 6.5). 
A single crystal was transferred to the reservoir solution (supplemented with 5 M 
sodium formate) for 1 h prior to submersion in a cold nitrogen stream. 

Data collection and processing. The “frozen” nsplO-nspll crystal diffracted at 
up to a 2.1-A resolution. Single-wavelength anomalous dispersion (SAD) data 
using the bound zinc were collected for the native nsplO-nspll crystal at 100 K 
using an SBC2 (3,000 by 3,000) CCD detector on beamline BL19-ID at the 
Advanced Photon Source (Argonne National Laboratory). The intensity data 
were processed and scaled using the software HKL2000 (11). Data collection 
statistics are summarized in Table 1. 

Phasing, model building, and refinement. Forty-eight heavy atoms were lo¬ 
cated from a search with the program SHELX (16, 21) that were consistent with 
the solution from a self-rotation function search using the program MOLREP 
(22). Initial phases were calculated using the single-wavelength anomalous dis¬ 
persion (SAD) method with the program SOLVE (18). Twenty-two sites were 
found to have occupancy greater than 50%, and the remaining sites had occu¬ 
pancy greater than 30%. Density modification at 2.8-A resolution by solvent 
flipping was performed with the program CNS (3). 

Initial tracing of the monomer polypeptide chain was performed manually 
using the program O (7), and a starting dodecameric model was generated using 
noncrystallography symmetry (NCS) relations generated from CNS. Initial re¬ 
finement was performed using simulated annealing in CNS with very tight NCS 
restraints. During the later stages of positional refinement, restraints were re¬ 
laxed and a bulk solvent correction was applied under the guidance of R free . 
Model geometry was verified using the program PROCHECK (9). Solvent mol¬ 
ecules were located from stereochemically reasonable peaks in the cr^-weighted 
F 0 — F c difference electron density map, where F a is the observed structure 
factor amplitude and F c is the calculated structure factor amplitude. Final re¬ 
finement statistics are shown in Table 1. 

Protein structure accession numbers. Coordinates and structure factors for 
the SARS-CoV nsplO crystal structures have been deposited in the RCSB Pro¬ 
tein Data Bank (PDB) under accession numbers 2G9T (for the 2.1-A structure) 
and 2GA6 (for the 2.7-A structure). 


RESULTS AND DISCUSSION 

Overall structure. The cDNA coding for SARS-CoV nsplO- 
nspll was amplified using PCR. The coded protein consists of 
amino acid residues 4231 to 4382 of ppla, which are renum¬ 
bered 1 to 152 hereafter for convenience, and includes both 
nsplO and nspll regions. The recombinant protein was ex¬ 
pressed in E. coli as a GST fusion protein, purified, separated 
from the GST tag, and crystallized in both the presence and 
absence of additional zinc ion. Crystals grown under these two 
conditions have the same morphology, and their crystal cell 
parameters are similar to each other. Unless otherwise men¬ 
tioned, the higher resolution crystal (with supplementary zinc 
salt) will be used in the following structural description. The 
crystal form belongs to an orthogonal space group with 24 
copies of nsplO-nspll per crystallography asymmetric unit and 
a V M of 3.1 A 3 Da -1 , where V M is the Matthews’ coefficient. 
The crystal structure was determined using SAD phasing from 
the bound zinc ions. The structure model was refined at 2.1-A 
resolution to a final R work of 21.9% and R free of 24.7%. Re¬ 
finement statistics are detailed in Table 1. All 24 crystallo- 
graphically independent copies of nsplO-nspll have very sim¬ 
ilar structures, with all pair-wise Ca root mean square 
deviations (RMSD) being less than 0.5 A. Residues before 
Ala9 (including five leading residues left from the tag) and 
Pro86-Gly88 and after Serl29 could not be traced due to lack 
of interpretable electron density. Since the latter region in¬ 
cludes the entire nspll peptide, the crystallized protein is 
nsplO per se; it is called such hereafter. The absence of Cysl30 
and subsequent residues is a result of incidental cleavage, as 
indicated by (i) the reduced molecular weight band from so- 
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FIG. 1. Ribbon representation of the nsplO crystal structure. A. 
One nsplO trimer is viewed along the noncrystallography threefold axis 
from the outside of a dodecamer. The three protomers are colored in 
magenta, gold, and green. B. The remaining three trimers of the 
dodecamer are shown in the same orientation as that for panel A using 
the same color scheme. C. The same as panel B but rotated by 90°. D. 
The relationship between threefold axes. The twist angle between pairs 
of threefold axes is approximately 108°. E. The complete dodecamer 
structure of SARS-CoV nsplO. The three protomers in a trimer are 
colored in magenta, gold, and green. This figure was prepared with the 
programs Molscript (8), Bobscript (5), and Raster3D (10). 


dium dodecyl sulfate polyacrylamide gel electrophoresis anal¬ 
ysis compared with that of a freshly prepared protein sample 
and (ii) matrix-assisted laser desorption ionization-time of 
flight mass spectrometry analysis (data not shown). The cause 
of the loss of Cysl30 and its C-terminal residues remains un¬ 
clear. However, analysis of the crystal packing suggests that the 
C-terminal peptide deletion is required for the formation of 
the crystal lattice. 

The 24 independent copies of nsplO assemble into two iden¬ 
tical spherical dodecamers related by a local twofold symmetry. 
The novel dodecameric sphere possesses a tetrahedral symme¬ 
try and can be viewed as the assembly of four nsplO trimers 
(Fig. 1). The dodecamer is hollowed in the center, with an 
outer radius of 42 A and an inner radius of 18 A (Fig. 2). Two 
zinc binding sites were identified in the three-dimensional 
structure of each nsplO monomer. The first one is located in 
the N-terminal region and on the inner surface of the dodeca- 
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mer, and the second one is located at the C terminus on the 
outer surface. The missing C tail of nsplO-nspll peptide would 
also be located on the outer surface of the dodecamer. Both 
inner and outer surfaces of the dodecamer have a dominantly 
positive electrostatic potential (Fig. 2), although the calculated 
pi of nsplO is 5.8. 

The monomer fold. The full-length nsplO polypeptide has a 
mixed a/(3 fold comprised of five a-helices (al to a5), one 
3 10 -helix, and three (3-strands ((31 to (33) (Fig. 3). The mono¬ 
mer peptide model consists of residues Ala9 to Serl29 and is 
continuous, except that the region Pro86-Gly88 is missing in 
the a4-|33 connecting loop. Nevertheless, in the 2.7-A structure 
which was solved using crystals grown in the absence of addi¬ 
tional zinc, we were able to build the Pro86-Gly88 loop region 
for some nsplO monomers, suggesting that this region is flex¬ 
ible in general. The central core of the nsplO monomer is an 
antiparallel (3-sheet formed by strands (31 (residues 55 and 56), 
(32 (65 to 69), and (33 (96 to 100). The central (3-sheet is flanked 
on one side by helices a3 (residues 70 to 73) and a4 (75 to 79), 
while helices al (residues 10 to 18), a2 (23 to 32) at the N 
terminus, helix a5 (107 to 113), and the extended C-terminal 
coil shy away from the central core. Residues on the a 4 helix 
and a4-|33 loop constitute the N-terminal zinc binding site, and 
the C-terminal coil contributes to the C-terminal zinc binding 
site. A DALI (http://www.ebi.ac.uk/dali/) search indicated no 
similar match to the nsplO monomer in the current Protein 
Data Bank (PDB), suggesting a novel fold for nsplO. 

nsplO is cysteine-rich, with 13 cysteines in its sequence, 
namely Cysl7, Cys41, Cys46, Cys73, Cys74, Cys77, Cys79, 
Cys90, Cysl03, Cysll7, Cysl20, Cysl28, and Cysl30. Of these, 
Cys74, Cys77, and Cys90 form the N-terminal zinc binding site, 
while Cysll7, Cysl20, and Cysl28 chelate the C-terminal zinc 
ion (Fig. 4). Cysl30 is missing from the refined model due to 
the loss of the C-terminal peptide. Although Cysl7-Cys46 and 
Cys41-Cys73 pairs are close in the three-dimensional structure, 
with Sy-Sy distances of 5.5 A and 4.7 A, respectively, they do 
not form disulfide linkages. This observation is consistent with 
the presence of high concentrations of reducing agents in the 
crystallization reservoir. 

The trimeric assembly. Three monomers are related by a 
noncrystallographic threefold symmetry to form a trimeric sub¬ 
unit (Fig. 1A). The trimer is stabilized largely by hydrophobic 
interactions. The buried solvent-accessible surface (SAS) of an 
nsplO monomer within a trimer is about 620 A 2 (or about 9% 
of the total SAS of an nsplO monomer), with 65% contributed 
from hydrophobic atoms. Residue Val21 in the al-a2 loop of 
one monomer forms hydrophobic interactions with residues 
Val57 and Thr58 in strand (31 of an adjacent monomer; resi¬ 
dues Thrll5 and Thrll8 of one monomer also form hydro- 
phobic interactions with Thrll8 and Vail 19 of an adjacent 
monomer. Additional hydrogen bonds are formed between the 
Nc atom of Lys25 in one molecule and the O e2 atom of Glu60 
in an adjacent molecule, with a distance of 2.6 A, and between 
the main chain O atom of Pro84 in one molecule and the N£ 
atom of Lys95 in an adjacent molecule, with a distance of 3.3 
A. Furthermore, the three monomers are oriented such that 
their C-terminal zinc fingers are clustered around the threefold 
axis, with their zinc ions separated by approximately 14 A. 

Trimer-trimer interactions and the dodecamer architecture. 
The assembly of the 24 copies of nsplO into two identical 
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FIG. 2. Electrostatic potential of the nsplO crystal structure. A. The electrostatic potential of a complete dodecamer is mapped on its outer 
molecular surface. Negatively charged regions are colored in red, positively charged regions are colored in blue, and neutral regions are colored 
in gray. The C-terminal-bound zinc ions, which are located on the outer surface, are depicted as red spheres. The position of the front trimer is 
highlighted by a triangle. B. The inner surface electrostatic potential of the dodecamer is shown in the same orientation as that of panel A. C. A 
cross-section of the nsplO dodecamer to illustrate the minimum and maximum radii of the shell. D. The electrostatic potential surface of an isolated 
trimer. This figure was drawn with the program CCP4 mg (12, 13). 


dodecamers indicates that the dodecamer is a stable structural 
unit of nsplO under the crystallization conditions. The four 
nsplO trimers (named trimers 1 to 4) in a dodecamer are 
related by a tetrahedral symmetry (Fig. 1). Any combination of 
three trimers is related by a local threefold symmetry, and so is 
a combination of two trimers related by a local twofold sym¬ 
metry. The buried SAS of a trimer in the dodecamer is about 
3,040 A 2 (or about 16% of the total SAS of an nsplO trimer), 
with 69% contributed by hydrophobic atom groups. The crystal 
structure shows that helix al and residues in the a2-|31 loop 
(residues 42 to 46) play a key role in trimer-trimer interactions. 
In the first (threefold symmetry-related) interaction region, 
Leul4, Cysl7, Alal8, and Cys79 of trimer 1 form a hydropho¬ 
bic base, which is directed towards Phel9 of trimer 2. At the 
same time, Phel9 of trimer 1 forms hydrophobic interactions 
with the equivalent hydrophobic base of trimer 3, and so on. In 
the second (twofold symmetry-related) interaction region, 
Met44 and Leu45 of trimer 1 are oriented to interact directly 
with their counterparts in trimer 2; Val42 and Tyr96 of trimer 
1 interact with Tyr96 and Val42 of trimer 2, respectively. Both 
Cysl7 and Cys79 contribute to stability of the dodecamer ar¬ 
chitecture, although they do not form a disulfide bridge in the 


reduced conformation state. Following this protocol, the 12 
molecules can assemble to form a pseudododecahedron. 

The zinc ion binding sites. Zinc binding is a major structural 
feature of nsplO. Spectral analysis of the nsplO crystal during 
synchrotron data collection clearly demonstrated the presence 
of zinc, evident by a clear peak near the zinc absorption edge. 
It allows the nsplO crystal structure to be solved by the SAD 
method. Furthermore, the 2.7-A resolution structure of nsplO 
determined from crystals prepared in the absence of additional 
zinc shows the same monomer fold, dodecameric architecture, 
and conformation in the zinc binding sites, with an RMSD of 
0.4 A for all Ca atoms in a dodecamer. 

Two bound zinc ions were identified from the crystal struc¬ 
ture of nsplO with unambiguous electron density: one located 
in the N-terminal region on the inner surface of the dodecamer 
and the other one at the C-terminal region on the outer surface 
(Fig. 2 and 4). The first zinc binding site is formed by residues 
on helix a4 and the a4-|33 loop. This zinc ion is tetrahedrally 
chelated by the Sy atoms of three cysteine residues (Cys74, 
Cys77, and Cys90) and the N e2 atom of His83, which have bond 
distances of 2.3 A, 2.3 A, 2.4 A, and 2.2 A, respectively (Fig. 
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FIG. 3. nsplO monomer fold. A. Stereo diagram showing a Ca 
trace of the nsplO monomer. Two zinc ions are shown as gray spheres, 
and one chelating water molecule is shown as a smaller red sphere. B. 
Stereo ribbon diagram of the nsplO monomer with (3-strands in purple 
and a-helices in gold. Secondary structure elements are labeled. This 
view is rotated by —90° relative to that of panel A. This figure was 
prepared with the programs Molscript (8), Bobscript (5), and 
Raster3D (10). 


4A; Table 2). This binding site constitutes a CCHC-type zinc 
finger with a C-(X) 2 -C-(X) 5 -H-(X) 6 -C sequence motif. 

The second zinc ion is located at the C terminus and is also 
tetrahedrally coordinated (Fig. 4B). Three zinc-chelating cys¬ 
teine residues (Cysll7, Cysl20, and Cysl28) have distances of 
2.4 A from the zinc ion to their side chain Sy atoms. The fourth 
ligand is a water molecule, with clearly defined electron den¬ 
sity, whose distance to the zinc ion is 2.5 A. This chelating 
water molecule also interacts with Serl29 through a perfect 
hydrogen bond to form a stabilized hydrogen bond network. 
All bond lengths related to zinc binding are summarized in 
Table 2. Noteworthy is the observation that Cysl30 neighbors 
Serl29 in the nsplO sequence but did not exist in the crystal¬ 
lized protein. Otherwise, the presence of Cysl30 might suitably 
position it to chelate the zinc ion, suggesting the second zinc 
binding site should also be a zinc finger of a C-(X) 2 -C-(X) 7 - 
C-(X)-C sequence motif. This CCCC-type zinc finger is dis¬ 
tributed on the surface of the dodecamer and is relatively 
flexible in our structure compared to the CCHC-type zinc 
finger (Table 2). However, the flexibility of this region may not 
reflect the conformation of the full-length protein. 

A multiple-sequence alignment of nsplO from SARS-CoV 
with coronaviruses from groups I, II, and III of the genus 
Coronavims indicates that all seven observed zinc-chelating 
residues, plus Cysl30, are strictly conserved (Fig. 5), implying 
their importance in the functional replicase-transcriptase com¬ 
plex. In contrast, a number of other cysteine residues in SARS 
nsplO are not conserved at all. 



FIG. 4. Stereo views of the two independent zinc binding sites. A. 
The N-terminal zinc binding site with electron density. Key residues 
interacting with the zinc ion are labeled and depicted as sticks with 
carbon, nitrogen, oxygen, and sulfur atoms colored in yellow, blue, red, 
and green, respectively. The zinc ion is shown as a gray sphere. A 
refined-model-phased 2 F a -F c electron density map is shown for the 
zinc ion and chelated residues at 1.3 a. Secondary structure elements 
are labeled. B. The C-terminal zinc binding site with electron density. 
The turn between C117 and C120 is labeled. This figure was prepared 
with the programs Molscript (8), Bobscript (5), and Raster3D (10). 


Functional implications. The finding of zinc binding sites in 
nsplO provides a strong implication for nsplO biological func¬ 
tions. The zinc finger is a common DNA binding motif found 
as part of transcription-regulatory proteins, which can often 
include more than one zinc finger in the same peptide chain. 
Moreover, increasing evidence suggests that zinc fingers can 
more widely be used to recognize RNA or even in protein- 
protein recognition. 

Although the DALI search did not find any candidate struc¬ 
ture similar to SARS-CoV nsplO from PDB, a PFAM search 
(http://www.sanger.ac.uk/Software/Pfam) for similar sequence 
motifs identified several members of the HIT-type zinc finger 
family as nsplO homologous candidates. Named after the first 
protein that originally defined the domain, the yeast HIT1 


TABLE 2. Zinc chelation 


Zinc ion 

Chelating residue 

Chelating atom 

Bond length" (A) 

Zn 1 

Cys74 

s. 

2.29 ± 0.04 


Cys77 

s. 

2.27 ± 0.04 


His83 

N e2 

2.18 ± 0.05 


Cys90 

s. 

2.40 ± 0.03 

Zn 2 

Cysll7 

S. 

2.41 ± 0.05 


Cysl20 

S. 

2.45 ± 0.15 


Cysl28 

S. 

2.38 ± 0.11 


Wat997 

H,0 

2.51 ± 0.07 

Serl29 

Wat997 

h 2 o 

2.76 ± 0.16 

“The 

listed bond length is an 

average value over the 

24 monomers with 

standard deviations. 
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IBV 1 


SARS 
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HCOV-229R 70 
NKV 70 

IBV 71 




SARS 130 9 LMQ. . 

HCoV-2291 13S 0 • 

HHV I3S 070 ■ - ■ 

IBV I«0 7KSSVQ 

FIG. 5. Multiple-sequence alignment of SARS-CoV nsplO with representatives from all three groups of the genus Coronavirus. HCOV-229E, 
human coronavirus strain 229E; MHV, mouse hepatitis virus; IBV, avian infectious bronchiolitis virus. The secondary structure for SARS-CoV is 
shown at the top of the alignment; arrows indicate pi-strands, and helical curves denote a- or 3 10 -helices. Residues highlighted in red are identical 
among the compared proteins; residues highlighted in yellow are conserved. Residues important for zinc binding are marked with green triangles, 
and residues important for stability of the dodecamer are marked with blue vertical arrows. The alignment was generated by ClustalX (20) and 
drawn with ESPript (6). 


protein, the HIT-type zinc finger contains seven conserved 
cysteines and one histidine that can potentially coordinate two 
zinc atoms. While the function of the HIT-type zinc finger is 
unknown, this motif is mainly found in nuclear proteins in¬ 
volved in gene regulation and chromatin remodeling. To date, 
there are no three-dimensional structures of HIT-type zinc 
finger domains reported in the PDB. Therefore, our nsplO 
crystal structure serves as the first example of a three-dimen¬ 
sional structure of this novel class of double zinc finger-con¬ 
taining motif. 

This finding provides strong structural evidence that, like 
other better studied nsp proteins, nsplO also likely plays a role 
in RNA synthesis, as suggested by other researchers (26). 
nsplO is involved in network interactions with other nsp pro¬ 
teins, and the integrity of its zinc fingers seems important for 
such interactions. Experiments on mouse hepatitis virus 
(MHV), a group II coronavirus along with SARS-CoV, dem¬ 
onstrated the colocalization of nsplO with nsp7, nsp8, and 
nsp9, providing solid evidence for their interaction in the coro¬ 
navirus life cycle (2). Our chemical cross-linking experiment 
further demonstrated that SARS-CoV nsplO can be cross- 
linked with nsp9 (data not shown), which itself interacts with 
nsp8 (17). Furthermore, an MHV ts mutant, Alb ts6, encoding 
a mutant form of nsplO with a Gln65-to-Glu mutation, was 
shown to have a defect in negative-strand RNA synthesis (15). 
The Gln65 residue, conserved in all three groups of the genus 
Coronavirus, is located on strand (32 of the SARS nsplO struc¬ 
ture and hydrogen bonds via the N e2 atom to the main chain 
carbonyl oxygen of Gly52. Gln65 is thus important for the 
conformational stability of nsplO and particularly for the a4 
helix which forms part of the N-terminal CCHC zinc finger. 
Therefore, mutation of Gln65 might be expected to perturb the 
folding of ppla into a less productive conformation that would 
prevent it from participating in the formation of a replicase- 
transcriptase complex with negative-strand activity. 

Conclusions. The scientific significance of the SARS-CoV 
nsplO structure is at least threefold. First, nsplO has a novel 
protein fold. A search with the DALI web engine (http://www 
.ebi.ac.uk/dali/) for structural homologs failed to yield any 
match to the nsplO fold, suggesting a novel function for the 


nsplO family members. Second, nsplO possesses two zinc fin¬ 
gers, located in the N-terminal region and at the C-terminal 
region, with C-(X) 2 -C-(X) 5 -H-(X) 6 -C and C-(X) 2 -C-(X) 7 -C- 
(X)-C sequence motifs, respectively. These motifs are con¬ 
served in all three groups of the genus Coronavirus, and our 
crystal structure illustrates for the first time the significance of 
these conserved residues. Further sequence analysis suggests 
that nsplO is related to the HIT-type zinc finger family, which 
is often found in nuclear proteins involved in gene regulation 
and chromatin remodeling. Thus, our nsplO crystal structure 
becomes the first of a new class of zinc finger protein three- 
dimensional structures to be revealed experimentally. Third, 
the molecular assembly of nsplO is a hollow dodecamer with an 
outer diameter of 84 A and an inner diameter of 36 A. Twelve 
C-terminal zinc fingers stick out from the outer surface of the 
sphere, and another 12 zinc fingers are distributed around the 
inner surface. The strong positive electrostatic potential found 
on both the inner and outer surfaces of the dodecamer is 
intriguing, consistent with the probable function of nsplO in 
the RNA synthesis machinery. 

To date, the structures and functions of several components 
of the replication/transcription machinery have been deter¬ 
mined, including the nsp3 ADP ribose 1-phosphatase domain 
(14); nsp5 (23, 24), nsp7, and nsp8 in complex (25); and nsp9 
(4, 17). nsp5 is the main protease for cleavage of the replicase 
polyproteins, nsp7 and nsp8 are proposed to function as pro- 
cessivity factors for nspl2 (the RNA-dependent RNA poly¬ 
merase), and nsp9 is a single-strand RNA binding protein. The 
crystal structure reported here will help to clarify the func¬ 
tion^) of nsplO, in which the presence of two zinc fingers 
should enable it to play an important role in RNA synthesis. 
Elucidation of the nsplO structure will provide further insights 
into the sophisticated replication/transcription mechanism of 
SARS-CoV and other coronaviruses, such as mouse hepatitis 
virus (MHV), human coronavirus strain 229E, and human 
coronavirus strain HKU1. 
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